Similarity Group-by Operators for Multi-Dimensional Relational Data
نویسندگان
چکیده
منابع مشابه
Similarity Group-by Operators for Multi-dimensional Relational Data (Extended Abstract)
The SQL group-by operator plays an important role in summarizing and aggregating large datasets in a data analytics stack. The Similarity SQL-based Group-By operator (SGB, for short) extends the semantics of the standard SQL Group-by by grouping data with similar but not necessarily equal values. While existing similarity-based grouping operators efficiently realize these approximate semantics,...
متن کاملFast similarity join for multi-dimensional data
To appear in Information Systems Journal, Elsevier, 2005 The efficient processing of multidimensional similarity joins is important for a large class of applications. The dimensionality of the data for these applications ranges from low to high. Most existing methods have focused on the execution of high-dimensional joins over large amounts of disk-based data. The increasing sizes of main memor...
متن کاملThe similarity-aware relational database set operators
Identifying similarities in large datasets is an essential operation in several applications such as bioinformatics, pattern recognition, and data integration. To make a relational database management system similarity-aware, the core relational operators have to be extended. While similarity-awareness has been introduced in database engines for relational operators such as joins and group-by, ...
متن کاملDiscovering Multi-relational Latent Attributes by Visual Similarity Networks
The key problems in visual object classification are: learning discriminative feature to distinguish between two or more visually similar categories ( e.g. dogs and cats), modeling the variation of visual appearance within instances of the same class (e.g. Dalmatian and Chihuahua in the same category of dogs), and tolerate imaging distortion (3D pose). These account to within and between class ...
متن کاملSimultaneous Approximation Terms for Multi-dimensional Summation-by-Parts Operators
This paper continues our effort to generalize summation-by-parts (SBP) finite-difference methods beyond tensor-products in multiple dimensions. In this work, we focus on the accurate and stable coupling of elements in the context of discontinuous solution spaces. We show how penalty terms — simultaneous approximation terms (SATs) — can be adapted to discretizations based on multi-dimensional SB...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Knowledge and Data Engineering
سال: 2016
ISSN: 1041-4347
DOI: 10.1109/tkde.2015.2480400